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Abstract 



Since the pioneering work of Gerhard Griiss dating back to 1935, Grass's inequality and, 
more generally, Grass-type bounds for covariances have fascinated researchers and found 
numerous applications in areas such as economics, insurance, reliability, and, more generally, 
decision making under uncertainly. Gruss-type bounds for covariances have been established 
mainly under most general dependence structures, meaning no restrictions on the dependence 
structure between the two underlying random variables. Recent work in the area has revealed 
a potential for improving Gruss-type bounds, including the original Grass's bound, assuming 
dependence structures such as quadrant dependence (QD). In this paper we demonstrate 
that the relatively little explored notion of 'quadrant dependence in expectation' (QDE) 
is ideally suited in the context of bounding covariances, especially those that appear in 
the aforementioned areas of application. We explore this research avenue in detail, establish 
general Gruss-type bounds, and illustrate them with newly constructed examples of bivariate 
distributions, which are not QD but, nevertheless, are QDE. The examples rely on specially 
devised copulas. We supplement the examples with results concerning general copulas and 
their convex combinations. In the process of deriving Gruss-type bounds, we also establish 
new bounds for central moments, whose optimality is demonstrated. 

Keywords and phrases: Grass's inequality, covariance bound, Hoeffding representation, 
Cuadras representation, quadrant dependence, quadrant dependence in expectation, cop- 
ula, convex combination, Archimedean copula, Frechet copula, Farlie-Gumbel-Morgenstern 
copula, central moments, Edmundson-Madansky bound. 



1 Introduction 



The covariance, say Cov[V, W] between two random variables V and W, has played pivotal 
roles in numerous areas such as economics, finance, insurance, statistics, and, more generally, 
in decision making under uncertainty. For details on specific applications with references to 
many works in the areas that have greatly influenced our current research, we refer to Broil 
et al. [1], Egozcue et al. [9], [10], Furman and Zitikis [H], [15], [16], Zitikis [38]. A number 
of mathematics problems, especially those related to the theory of functions, have also been 
successfully tackled with the aid of covariance- type considerations (see, e.g., Dragomir and 
Agarwal [5], Dragomir and Diamond [6], Furman and Zitikis [12], [13], Izumino and Pecaric 
[25], Izumino et al. |26j). Solutions to problems in these areas often rely on determining the 
sign of covariances as well as on establishing their lower and upper bounds. 

The random variables V and W are often unobservable but are known to be transfor- 
mations (also called distortions) of some observable random variables X and Y; that is, 
V = a(X) and W = /3(Y) for some functions a, /3 : R — > R. Consequently, the covariance 
Cov[a(X) , (3(Y)} becomes of interest. In a large number of applications, only one of the 
two random variables is distorted. In this paper we concentrate on this case, thus restricting 
ourselves to an in-depth analysis of the covariance 



If compared to the more general covariance Cov[a(X), 0(Y)], this reduction of generality 
plays a significant role in providing us with additional technical tools, including the notion 
of 'quadrant dependence in expectation' (QDE) to be defined in Section [3] below, and thus 
in turn allows us to establish deeper results than those available in the literature under, say, 
the notion of quadrant dependence (QD). In applications where covariance fll.ip emerges, 
the distortion function (3 might be, for example, a utility or value function (see, e.g., Broil 
et al. [I], Egozcue et al. [H], [ID], and references therein), some insurance-premium loading 
function (see, e.g., Furman and Zitikis [H], [T5], [TB]; Sendov et al. [34] . and references 
therein) . 

When estimating covariance fll.ip . perhaps most naturally that comes first into our mind 
is the Cauchy-Schwarz inequality 



where Var[X] is the variance (i.e., Cov[X, X]) of the random variable X. Furthermore, 
assuming that there are finite intervals [a, A] and [b, B] such that X e [a, A] and (3(Y) e [b, B] 
almost surely, from bound f ll.2p we immediately obtain Griiss's [23] inequality 



Cav[X,P(Y)]. 



(1.1) 




(1.2) 



Cav[X,P(Y))\ < 



(A-a)(B-b) 



(1.3) 



4 



(see, e.g., Zitikis [3B] for details and references). Inequalities ( II. 2p and (II. 3p hold irrespec- 
tively of the dependence structure between X and (3{Y), which implies that the inequali- 
ties also hold under the 'worst possible' dependence scenario, which is associated with the 
strongest dependence structure between X and f3(Y), arising when X = (3(Y) almost surely. 
It is under this scenario that the optimality of the Griiss's bound has been established in 
the literature, and we refer to, e.g., Dragomir 11], [7], Mitrinovic et al. [31], Steele [35], and 
Zitikis [38] for further notes, examples, and references on the topic. 

When the random variables X and /3(Y) are independent, which in particular happens 
when the underlying random variables X and Y are such, then the covariance Cov[X, (3(Y)] 
is zero. Hence, knowing how much and in what sense the random variables X and Y 
are dependent plays a significant role when investigating the magnitude of the covariance 
CovLY, (3(Y)] and its sign, among other properties. This line of research has been advocated 
by Zitikis [38] and Egozcue et al. [9], who have employed the notion of quadrant dependence 
to be defined rigorously in Section [3] below. 

We conclude this section with a guide through the rest of this paper. In Section [2} we first 
show how the assumption of bivariate normality leads, via the well-known Stein's Lemma, 
towards a Gruss-type covariance bound. We then extend this bivariate normal case into the 
formulation of a general Gruss-type covariance bound, which we aim at establishing in various 
situations throughout the current paper. In Section EJ we recall definitions of QD and QDE 
and their counterparts for copulas, and also relate these notions of dependence to Griiss- 
type covariance bounds. In Section [3] we also establish general results concerning convex 
mixtures of negative quadrant dependent (NQD) and positive quadrant dependent (PQD) 
copulas that provide a basis for constructing bivariate distributions which are QDE but not 
QD. We devote Section H] to constructing several illustrative examples of copulas which are 
QDE but not QD; as far as we are aware of, these examples are the first ones in the literature. 
In Section |5l we establish QDE-based Gruss-type bounds for covariance ( ll.ip . discuss their 
optimality and highlight the importance of having tight bounds for central moments of 
random variables. We investigate the latter bounds in great detail in Section El Since the 
QDE notion of dependence also naturally leads towards regression-based considerations, in 
Section [7J we establish regression-based Gruss-type bounds for covariance (II. ip . 

2 Formulation of the problem 

Applications often suggest models for (X, Y) but it may not be feasible to assume models 
for the pair (X,(3(Y)) because the distortion function (3 may change depending on, say, 
investor, insurer, etc. For this reason it is desirable to separate the underlying stochastic 
model, which is based on (X, Y), from the class of distortion functions j3. 



Stein J3S| noted that if the pair (X, Y) follows the bivariate normal distribution and the 
function (3 is differentiate, then 



Cav[X,p(X)] = Cov[X,Y]E[f3'(Y)}. (2.1) 

This equation, frequently known as Stein's Lemma, separates the dependence structure 
from the distortion function /?. For extensions and generalizations of this result, we refer to 
Furman and Zitikis [H], [TS], [TB], [17], and references therein. In particular, it has been 
observed that equation (12. ip is a direct consequence of the following one 

CovLY, (3(Y)} = C °^ y [ ] Cov[F, /3(F)], (2.2) 

which separates the dependence structure of (X, Y) from the distortion function /3 but does 
not require the differentiability of 0. 

Now we rewrite equation (12. 2p in the form 

Cov[X,/3(Y)] = Corr[X,F]G [X,F,/3], (2.3) 

which we call to be of the 'Griiss form' for reasons to be made clear below (Problem 12.11 
below), where Corr[X, Y] is the Pearson correlation coefficient between X and Y, and 
Gq[X,Y, j3] is a 'Griiss factor' defined by 



Go[x,y^] = ^^Scov[y,/3(y)]. 

Note that the Griiss factor Gq[X,Y, /3] does not depend on the bivariate distribution of 
(X, Y) except that it depends on the cumulative distribution functions (cdf ) F and G of the 
underlying random variables X and Y, respectively, and also on the distortion function /?. By 
the Cauchy-Schwarz inequality, |Go[X, Y, 0]\ does not exceed the product of the standard 
deviations ^/\ar\X] and ^/Var[/3(y)], which do not exceed (A — a)/2 and (B — b)/2, 
respectively, under what we call the 'Griiss condition': 

• There are two finite intervals [a, A] C R and [b, B] C R such that X G [a, A] and 
fi(Y) G [b, B] almost surely. 

Hence, under the Griiss condition, we have that Go [X, Y, f3] does not exceed the right-hand 
side of bound (11 .3p , and we thus have that 

| CovLY, /3(F)] | < |Corr[X,F]| {A - a)< f - b) . (2.4) 

Since |Corr[X, Y]\ does not exceed 1, bound (12 .4p implies Griiss's bound (11.31) irrespectively 
of the dependence structure between X and Y. When these two random variables are 



independent, then the right-hand side of bound (11.31) is zero. This demonstrates the pivotal 
role of the dependence structure when sharpening Griiss's bound. 

Reflecting upon the notes above, we next put forward a general formulation of the problem 
that we shall tackle from various angles throughout this paper. 

Problem 2.1 We are interested in establishing bounds of the form 

\Cav[X,P(Y)]\ < D[X,Y]G[X,Y,(3], (2.5) 

where 

• ~D[X,Y] is a 'dependence coefficient', which must be equal to when the random vari- 
ables X and Y are independent, and should not depend on the distortion function j3; 

• G[X,Y,(3] is a 'Griiss factor', which should not depend on the dependence structure 
between X and Y but may depend on (3 and the cdf's F and G of X and Y , respectively. 

Throughout the paper we assume that the distortion function (3 : R — > R is of bounded 
variation, meaning that it can be written as the difference (3 = Pi — (3 2 of two non-decreasing 
functions 0i , (3 2 ■ R — > R- The corresponding function \(3\ : R — > R is defined by the equation 
\(3\(y) = fiiiy) + 02(y)- When (3 is different iable, then d\/3\(y) = \(3'(y)\dy. Furthermore, we 
use 1{S} for the indicator function of statement S which is equal to 1 when the statement 
S is true and otherwise. Hence, in particular, for any random variable Z and any real 
number z, 

T Z {Z) = 1{Z > z} 

is a random variable that takes on the value 1 when Z > z and otherwise. We shall 
frequently view t z (Z) as a random function of z. In our following considerations, we shall 
also use the sign- function, sign(x), which takes on three values: —1 when x < 0, when 
x = 0, and +1 when x > 0. 



3 QD and QDE random variables and copulas 

One of the most fundamental equations that we utilize in the present paper is the Cuadras- 
Hoeffding representation 

Cav[a(X),P(Y)] = J J Cov[r x (X), r y (Y)] da(x) d/3(y) (3.1) 

of the covariance between the transformed random variables a(X) and /3(Y). The repre- 
sentation has been established by Cuadras [2] assuming, naturally and necessarily, that the 
expectations of a(X), (3(Y), and a(X)(3(Y) are well-defined and finite. Covariance represen- 
tation (13.1 1) generalizes the classical Hoeffding's [21] representation established in the case 



a(x) = x and (3(x) = x (see also Sen [S3])- The importance of representation (13. If) in our 
context is that it achieves a separation of the dependence structure present in (X, Y) from the 
distortion functions a and j3. Hence, in particular, the positive quadrant-dependence (defi- 
nition follows) implies that Cov[X, Y] > 0, and the negative quadrant-dependence implies 
that Cov[X, Y] < 0. These are, of course, well-known facts (Lehmann [28]). 

Definition 3.1 (Lehmann f2$[) Two random variables X and Y are positively (resp. neg- 
atively) quadrant dependent if Cov\r x (X),T y (Y)\ > (resp. < 0) for all x, y G R. We 
abbreviate this as PQD (resp. NQD), and when it is not important to specify whether the 
two random variables are PQD or NQD, then we simply say that they are quadrant dependent 
(QD). 

As a special case of representation (13 .ip we have the following one: 

Cav[X,P{Y)\ = J J Cov[T x (X),r y (Y)]dxdp(y). (3.2) 

Note that the inner integral on the right-hand side of equation (I3.2p is equal to Cov [X, T y (Y)\ , 
and so representation ( 13.2ft becomes 

Cov[X,p(y)] = J Cov[X, r y (Y)]dP(y). (3.3) 

The integrand on the right-hand side of equation ( 13.31) is related to the following definition. 



Definition 3.2 (Kowalczyk and Pleszczynska f27\l ) A random variable X is positively (resp. 
negatively) quadrant dependent in expectation on a random variable Y if Cov\X, r y (y)] > 
(resp. < 0) for all y e R. We abbreviate this as X is PQDE (resp. NQDE) on Y , and when 
it is not important to specify whether these two random variables are PQDE or NQDE, then 
we simply say that X is quadrant dependent in expectation (QDE) on Y. 

QDE is not a stronger notion than QD, which follows from the already noted but not 
explicitly written equation: 

Cov[X,r„(y)] = J Cov[r x (X),r y (Y)]dx. (3.4) 

For discussions and hints on potential applications of this notion of dependence, we refer to 
Kowalczyk and Pleszczynska [27], Wright [37J, and references therein. One would actually 
expect that QDE is a weaker notion than QD, which means that there must be pairs (X, Y) 
which are QDE (i.e., either NQDE or PQDE) but not QD (i.e, neither NQD nor PQD). Our 
search of the literature has not, however, revealed examples that would formally confirm 
this non-equivalence of QDE and QD. Hence, we next present general results pointing in 
the direction of non-equivalence, and we shall use them in Section H] as our guide when 
constructing specific examples of bivariate distributions that are QDE but not QD. 



The main tool that we are going to employ is the notion of copula, which is a surface 
(u, v) (-> C(u, v) defined on the square [0, 1] x [0, 1] and such that P[X < x, Y < y] is equal to 
C(F(x), G(y)), where F and G are the cdf's of X and Y, respectively. Hence, in particular, 
we have the equation 

Cov[X, r y (Y)] = J (c(F(x),G(y))-F(x)G(y)yx. (3.5) 

When the random variables X and Y have uniform (marginal) distributions, then we denote 
them by U and V, respectively. In turn, we have the following reformulations of Definitions 
13.11 and 13.21 in terms of the copula C, which is connected to the bivariate distribution of 
(U, V) via the equation 

P[U <u,V < v] = C(u 7 v). 

Namely, U and V are PQD (resp. NQD) if C(u, v) > uv (resp. C(u,v) < uv) for all 
u,v G [0,1], and U is PQDE (resp. NQDE) on V if C(v) > (resp. C{v) < 0) for all 
v G [0, 1], where C(v) = Cov[U,r v (V)], that is (cf. equation (13.51) ). 

C( v ) = / (G(u,v) — uv)du. 
Jo 

In general, we have the following QD and QDE definitions for copulas. 

Definition 3.3 Copula (u,v) h- > C(u,v) is PQD (resp. NQD) if C(u,v) > uv (resp. 
C(u, v) < uv) for all u, v G [0, 1] . The copula is QD if it is either NQD or PQD. 

Definition 3.4 Copula (u,v) i-> C(u,v) is PQDE (resp. NQDE) if C(v) > (resp. < 0) 
for all v G [0, 1]. The copula is QDE if it is either NQDE or PQDE. 

Note 3.1 In Definition 13.41 it would be more precise to say that U is PQDE (resp. NQDE) 
on V if C{v) > (resp. < 0) for all v G [0, 1]. Analogously, U is QDE on V if U is either 
NQDE or PQDE on V. We avoid this pedantry by always considering the 'first variable' to 
be (N/P)QDE on the 'second variable'. 

Hence, the problem that we are interested in at the moment is whether there are any 
copulas that are QDE (i.e., either NQDE or PQDE) but not QD (i.e, neither NQD nor 
PQD). The following two general theorems are fundamental in solving this problem, with 
illustrative examples provided in Section HI 

Theorem 3.1 Let Cq(u,v) and C\(u,v) be NQD and PQD copulas, respectively. Denote 
their convex combination byC a (u,v) = (1 — a)C (u, v) + aCi(u, v) with parameter a G [0,1]. 
Suppose that the surface 

K v) ^ 777 r — r 3.6 

is not constant on [0, 1] x [0, 1] . Then there exist m, m', M', M G [0, 1] such that < m < m! , 
M' < M < 1, and m < M, and such that the copula C a is: 



• NQDfora E [0,m]/ 

• neither NQD nor PQD for a E (m, M); 

• PQD for a E [M, 1]; 

• NQDE if and only if a G [0,m'] frf couZd 6e t/jat m = m',); 

• neither NQDE nor PQDE for a G (m', M') (zi coit/d fre i/iai m' > M', m which case 
the interval (m',M') is empty); 

• PQDE if and only if a G [M' , 1] couZd 6e that M = M'). 

Proof. Let I_ = {a E [0, 1] : C a is NQD} and 7+ = {a E [0, 1] : C a is PQD}. We have 
the following facts: 

1. OeL and 1 G I+. 

2. 7_ (similarly J + ) is a closed subspace of [0, 1]. Namely, if C ak (u,v) — uv < for all 
u,v E [0, 1] and — )■ a, then C a (tt, v) — uv < for all u,v E [0, 1]. 

3. 7_ (similarly 7 + ) is a connected space. Namely, if a, /3 E 7_, then C 7 for any 7 G [a, /3] 
is a convex combination of C a and C/3, and so it is NQD. 

4. 7_ and 7 + are closed intervals (it follows from 2 and 3). 

5. I fl 7 + = 0. We prove this by contradiction. Suppose that there exists a G 7_ fi 7 + . 
Then C a is NQD and PQD. This implies that (1 — a)Co(u, v) + aC\(u, v) — uv = for 
all u,v E [0, 1]. Hence, function (13. 6p is equal to the constant a; a contradiction. 

In view of the above facts we have that 7_ = [0,m] and 7 + = [M, 1] with m < M, and the 
first three statements of Theorem 13.11 follow. In a similar way, but working with the function 



we establish the other three statements of Theorem 13.11 Note that NQD (PQD) implies 
NQDE (PQDE), and so we must have m < m' and M' < M . This completes the proof of 
Theorem 13.11 ■ 

Note 3.2 If we have m < m', then there are a values such that the copula C a is neither 
NQD nor PQD, but it is NQDE. Similarly, if we have M' < M, then there are a values such 
that the copula C a is neither NQD nor PQD, but it is PQDE. 




(3.7) 



Theorem 3.2 Let the assumptions of Theorem lff.il be satisfied, and let C a (v) be given by 



equation \3. 7| ). Furthermore, assume that there is a constant k G [0, 1] such that 

Cq(v) 
C {v)-Cx{v) 

for all v G (0, 1). Then there is an open interval of a values such that the copula C a is 
neither NQD nor PQD, but it is either NQDE or PQDE. 

Proof. We have that C K (v) = for all v G [0, 1]. Thus, the copula C K is both PQDE and 
NQDE. By the previous theorem, we have that M' < K < m! . Since m < m/, M' < M and 
m < M, we deduce that m < ml or M' < M. Consider these two cases separately: 1) if 
m < m', then for any a G (m, m'] the copula C a is neither NQD nor PQD, but it is NQDE, 
and 2) if M' < M, then for any a G [M', M) the copula C a is neither NQD nor PQD, but 
it is PQDE. This completes the proof of Theorem | 



4 Examples of QDE copulas which are not QD 

Here we give three examples of QDE copulas that are not QD. In the first two examples 
we choose NQD and PQD copulas such that their convex combinations are not QD but, 
nevertheless, are QDE. The third example is based on a copula which is not QD but, under 
an appropriate choice of marginal distributions, produces a bivariate distribution that is not 
QD but, nevertheless, is QDE. These three examples open up broad avenues for constructing 
QDE copulas that are not QD, using a myriad of existing copulas whose QD-type properties 
have been documented in the literature (e.g., Nelsen [32] )• For discussions concerning copulas 
in the context of actuarial, financial, and other applications, we refer to, for example, Denuit 
et al. [3], Genest and Favre [18], Genest et al. [19], McNeil et al. [29], and references therein. 

Example 4.1 The Frechet lower-bound (FL) copula 

Cfl(u, v) = max{0, u + v — 1} 

is NQD, and the Frechet upper-bound (FU) copula 

Cfu(u,v) = mm{u,v} 

is PQD. Both are defined on the unit square [0, 1] x [0, 1]. Let C a be the convex combination 
of the two Frechet copulas (cf. McNeil et al. [29]): 

C a (u, v) = (1 - a)C FL (u, v) + aC FU (u, v), (4.1) 

where a G (0, 1). We see from Figure I4TT1 that the copula C a is neither PQD nor NQD. To 
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Figure 4.1: The surface (u, v) h» sign[C a (M, v) — uv] when a = 1/4 (left top panel), a = 1/2 
(center top panel), and a = 3/4 (right top panel) with the corresponding contour plots 
beneath them. 

check whether C a is QDE (i.e., either PQDE or NQDE), we calculate the integral 




Hence, C a (v) < for all v G [0, 1] if and only if a < 1/2, meaning that the copula C a is 
NQDE. Likewise, CJv) > for all v G [0,1] if and only if a > 1/2, meaning that C a is 
PQDE. Hence, for example, when a = 1/4, then C a is neither PQD nor NQD but it is 
NQDE. Likewise, when a = 3/4, then C a is neither PQD nor NQD but it is PQDE. This 
concludes Example 14. 1[ 

Example 4.2 Here we first choose the Farlie-Gumbel-Morgenstern (FGM) copula 

C FGM (u,v) = uv(l + 6(1 - u)(l - v)) 

with 9 G [—1,1]; we set the parameter 9 to — 1 throughout this example to make the FGM 
copula NQD. Next we choose the already noted Frechet upper-bound copula Cpu(u,v) = 
min{n, v}, which is PQD. Let a G (0, 1) be a parameter, and let C a be the convex combina- 
tion of the above two copulas: 



C a (u, v) = (1 - a)C FGM (u, v) + aC FU (u, v). 



(4.3) 



We have that 



C a (u,v) — uv = a(min{w,p} — uv) — (1 — a)uv(l — u)(l — v) 

u(l — v) (l — (1 — a)(l + v(l — it))) when u < v, 
v(l — u) (l — (1 — ol)(1 + u(l — v))) when u>v. 



(4.4) 



Hence, C Q (it, u) — uv > for only those (it, u) G [0, 1] x [0, 1] that are between (cf. equation 
(14.4jl ) the zero-curve 



?7 a = {(M,t;): (l-a)(l + u(l-u)) = l} 
from above, and the zero-curve 

L a = {(u,v): (l-a)(l + u(l-u)) = l} 



that is, f = 



a 



1 — it 1 — a 



that is, v = vl(u) = 1 



1 a 



u 1 — a 

from below. We illustrate the two curves in Figure 14.21 Note that the curves intersect in the 




Figure 4.2: The surface (u,v) h> sign[C a (u,v) — uv] when a = 1/6 (left top panel), a = 1/5 
(center top panel), and a = 1/4 (right top panel) with the zero-curves U a and L a in the 
corresponding plots beneath them. 

interior of the square [0, 1] x [0, 1] only when a G (0, 1/5) and touch each other at one point 
when a = 1/5. As to the PQDE or NQDE, we calculate the integral 



CM 



(r ( \ U "(1-^ 4 1 
{C a {u, v) — uvjau = I — a — 



(4.5) 



Hence, C a (v) < for all v G (0, 1) meaning that C a is NQDE if and only if a < 1/4, and 
Ca(v) > for all v G (0, 1) meaning that C a is PQDE if and only if a > 1/4. This concludes 
Example [4721 

Example 4.3 We model the pair (X, Y) using the following Archimedean copula (Genest 
and MacKay [2Tj, [22]; Genest and Ghoudi [20]; see also Nelsen [32] for additional information 
and references) 

C a (u, v) = max jo, 1 - ((1 - u a ) l ' a + (1 - u a ) 1 /«) at } 1 ° , 

where a G (0,1) is parameter. The copula C a (u, v) is not QD. The zero-curve Z a (u,v) = 
C a (u, v) — uv = 0, which separates the NQD region from the PQD region, is given by 

Z a = {(u,v) : (1 - u a ) 1/a + (I - v a ) 1/a = (1 - u a v a ) 1/a } , 

depicted in Figure 14.31 Concerning the QDE property, we want to know if and when the 
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Figure 4.3: The surface (u,v) H- sign[C Q (w, v) — uv] when a = 5/10 (left top panel), a = 
7/10 (center top panel), and a = 9/10 (right top panel) with the zero-curve Z a in the 
corresponding plots beneath them. 

function C a defined by the formula 

C a (v) = / (C a (u,v) — uv)du 
Jo 



Figure 4.4: The function v i— > C a {v) when a = 5/10 (left panel), a = 7/10 (center panel), 
and a = 9/10 (right panel). In every case, there are regions (not necessarily clearly visible 
in the graphs) where the function is strictly positive and strictly negative. 

is positive or negative. Our experimental analysis has revealed that this function is negative 
for some v G [0,1] and positive for other v, and this applies to every a G (0,)]. For an 
illustration, we have produced Figure 14.41 This implies that for the purpose of constructing 
QDE pairs of random variables we cannot choose both marginal cdf 's uniform. Hence, we 
choose only the first marginal cdf (i.e., that of X) uniform. Since the second marginal cdf 
(i.e., that of Y) cannot be uniform, nor any other continuous cdf, we construct a discrete cdf 
G or, equivalently, a discrete random variable Y in such a way that U(= X) is PQDE on Y . 
First we note that, for every v G (0, 1), 

• C a (u,v) — uv is positive for some u G (0, 1) and negative for some other u G (0, 1), 
thus violating the QD property. 

To verify this non-QD property rigorously, we first rewrite the copula C a (u, v) as max{0, P(u, v)} 1 ^ 
with the notation 

P(u, v) = 1 - ((1 - u a ) 1/a + (1 - v a ) 1/a ) a . 

For every fixed v and when u — 0, then we have P(0, v) — 1 — (1 + (1 — v a ) l l a ) a < 0. Thus, 
C a (u, v) = in a neighbourhood of u = 0. From this we deduce that C a (u, v ) — uv < when 
u G (0, e) for some e > 0. In a neighbourhood of u — 1, we have that C a (u, v) — uv is equal to 
P(u,vY^ a — uv, which is a differentiable function. The partial derivative (d/du)(P(u,vY^ a — 
uv) at the point u — 1 is negative. Hence, the function u i— >■ P(u, v) l l a — uv is decreasing in a 
neghbourhood of 1. When u — 1, then we have P(l, v ) l ^ a = v and thus P(l, v) l l a — 1 • v — 0. 
Hence, the function tt h-> P(w,t>) 1 / a — or, equivalently, w >-)■ C a (u,v) — uv is positive in 
a neighbourhood to the left of n = 1. This establishes the property formulated under the 
bullet above. 

Next, in view of the equation 

Cav[U,r v (Y)]=C a {G(y)), (4.6) 

we construct Y such that its support is in an interval [v*, v**] C (0, 1) and 



• for every v G [v*, v**], we have C a (v) > 0, thus assuring that the PQDE property holds, 
provided that Y takes only on values in the interval [v*, v**]. 

The construction of the aforementioned Y is as follows. We choose a set of K points Vk > v* 
such that Ylk=i v k < v ** an d ^2k=i v k = 1- Define the cdf G by the formula 

K 

G (y) = ^v k l{y k < y}, 

k=l 

where yx < y% < ■ ■ • < Vk are real numbers. In other words, the random variable Y takes on 
the values y k with the probabilities v k . Note that the range of the cdf G is the set {0, v±, V\ + 
u 2, • • • , Ylk=i v k, !}• % construction, C a (i>) > for all v G {i>i,t>i + u 2 , • • • , Y.k=i v k}- 
Hence, in order to verify that C a (G(Y)) > for all real y G R, we are only left to check that 
C a (v) > for v G {0, 1}, but this holds because C a (u, 0) = and C a (u, 1) = u. 

In summary, we have constructed a pair (U,Y) such that Cov[E/,Tj,(y)] > for all 
y G R, that is, U{= X) is PQDE on Y, but the pair is not QD, that is, it is neither PQD 
nor NQD. This concludes Example 14.31 



5 QDE-based Griiss-type covariance bounds 

From the previous two sections we know that the set of QDE random pairs is larger than 
the set of QD pairs. In this sense, establishing Griiss-type covariance bounds under the 
QDE assumption would be an extension of those established under the QD assumption. We 
explore such QDE-based results in the current section. In what follows, we use the notation 

A k [Z] = {E[\Z-E[Z}\ k ]) 1/k . 

The next theorem, whose proof is a consequence of equation (13.31) . utilizes the QDE notion 
and establishes a sharper bound than Griiss's original bound (11. 3p . 

Theorem 5.1 For every pair p, q G (1, oo) such that p~ l + g" 1 = 1, we have the bound 

|Cov[X,/3(Y)]| <D p [X,Y]G p [X,Y,/3], (5.1) 

where the QDE-based dependence coefficient is 

n tv vl |Cov[X,T,(r)]| 

with the supremum taken over all y G R such that G(y) G (0, 1), that is, over the support of 
the random variable Y , and where the QDE-based Griiss factor is 

G P [X,Y,P] = A P [X] [ A q [T y (Y)]d\p\(y). 



Before discussing properties of the QDE-based dependence coefficient and Grass's factor, 
we first show that bound (15. ip implies Grass's bound (If .3p . 



Statement 5.1 Setting p = 2 and (3(x) = /3q(x) = x, we have that under the Griiss condi- 
tions on X and Y , Griiss 's bound $1.3\) follows from Theorem \5.1\ 

Proof. Since D2[X, Y] < 1, we only need to show that 

w, M < {A - a) l B - b) - ("J 

Since G(y)(l — G(y)) does not exceed 1/4 and is equal to outside the support of Y, and since 
Y G [b, B] C R almost surely, we have that G2[X, Y, j3 ] does not exceed y/Var[X] (B — b)/2. 
Furthermore, since X G [a, A] C R almost surely, then (see, e.g., Zitikis [38], p. 16) A/Var[X] 
does not exceed (A — a)/2. Hence, bound (15.2p holds. ■ 

The dependence coefficient D p [X, Y] is always in the interval [0, 1]. It takes on the value 
when X and Y are independent. Furthermore, the coefficient achieves its upper bound 1, 
as seen from the following statement. 

Statement 5.2 The dependence coefficient D p [X,Y] achieves its upper bound 1. 
Proof. Given Y, let X be the random variable X Q defined by the equation 
*o = e\r yo (Y) - E[r yo (Y)}\ 9/p sign(r w (Y) - E[r yo (Y)]), 

where 

• the number y > is any but fixed, and 

• the random variable e, independent of Y, takes on the two values ±1 with same 
probabilities p = 1/2. 

The expectation E[X ] is equal to 0. The absolute value |X | is equal to ^(V)— EfT^Y)] \ 9 P 
The covariance Cov[X , T y (Y)} is equal to A^[r 3/ (F)]. Hence, D p [X 0; Y] — 1. ■ 

The magnitude of the coefficient D p [X, Y] depends on the dependence structure between 
X and Y, as well as on the marginal cdf 's of the two random variables. For example, in the 
case of independent X and Y, we have D P [X, Y] = 0. Less trivial and thus more interesting 
examples follow. 

Example 5.1 Consider the convex combination of the lower and upper Frechet copulas as 
defined by equation (14.11) . Both X = U and Y = V have uniform distributions on the 
interval [0, 1], and thus 

Ap[U]= 2(v+1)Vp and A,[t u (V)] = «;/«(«), (5.3) 



where the function K q : [0, 1] — > [0, 1/2] is defined by 



K q (v) = v(l - v) q + (1 - v)v q 



Furthermore, since Cov\U,t v {V)] = C a (v) with C a (v) given by equation (I4.2p . we have that 

v (1 — v) 



B p [U,V} = 2(p + iy/P 



1 

a ~2 



sup 



0<v<l K q /q (v) 

Recall that q = p/(p — 1). The QDE-based Griiss factor is 

1 



(5.4) 



G p [U,V,0\ 



Kl /q (v)d\/3\(v). 



2(p + iy/P J0 

In the special case when p = 2 and (3(v) = (3q(v) = v, we have that 

D 2 [U,V] = V3 
G 2 [U,V,Po]-. 
Bound (15.11) therefore implies that 



(5.5) 



1 

i 



7T 



9 AT / \/v{l-v)dv^ 

lyo Jo loyo 



(5.6) 
(5.7) 
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w 0.19635 
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~ 16 
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2 



(5.J 



whereas the exact calculation using, for example, the equation Cov[X, Y] = C a (v)dv and 
formula ( 14. 2 p gives the value 



Cov[X, y] = i U - M ps 0.166667 (a - ^ ) 



(5.9) 



This concludes Example 15.11 



Example 5.2 Consider the convex combination of the Farlie-Gumbel-Morgenstern and upper- 
Frechet copulas as defined by equation (14. 3p . Just like in the previous example, both X — U 
and y = V have uniform distributions on the interval [0, 1]. Thus, equations (15. 3p hold in 
the current case as well. The covariance Cov[U, t v (V)\ is, however, different: it is equal to 
C a {v) given by equation (I4.5p . Hence, the QDE-based dependence coefficient is 

v(l — v) 
sup — —, , 

0<f<l K g /q (v) 

Recall that q = p/{p — 1). Note that the QDE-based Griiss factor G P [U, V, 0\ is unaffected by 
the change of the dependence structure and therefore has the same expression as in previous 
Example 15.11 (see eq. (15.51) ). In the special case when p = 2, from the above formulas we 
have that 



D p [U,V] = (p+l 



-a 

3 3 



(5.10) 



D 2 [c/,y] 



v/3 



— a 

3 3 



(5.11) 



Bound (15.11) therefore implies that 



|Cov[X,F]| < 



7T 



32 



4 1 
— a 

3 3 



0.0981748 



4 1 

— a 

3 3 



(5.12) 



whereas the exact calculation using the equation Cov[X, Y] = C a (v)dv and formula (14.51) 
gives the value 



Cov[X, Y] 



1 /4 1 

— —a 

12 V 3 3 



,'4 1 
0.0833333 -a - - 



(5.13) 



This concludes Example 



6 Estimating the QDE-based Griiss factor 

In Example 15. II we calculated the QDE-based Griiss factor G P [U, V, f$\ in the case of uniform 
random variables U and V. In Statement 15 . 1 1 we estimated G2[X, Y, (3 ] under the Griiss con- 
dition on X and Y. In the current section we develop general results that aid in establishing 
tight upper bounds for the QDE-based (general) Griiss factor G P [X, Y, /?]. Specifically, upon 
expressing the quantities A P [X] and A g [r y (F)] by the formulas 

A P [X] = (E[|X - ^}) 1/P 

and 

\[r y (Y)} = «;/«(G(y)), 
where the mean fi = E[X], the cdf G of Y, and the function 

K q (x) = x(l — x) q + (1 — x)x q , 

we see that estimating G P [X, Y, (3] relies on tight bounds for the p th central moment E [\X — 
/x| p ] as well as on the function K q (x). Interestingly, as we shall see from Theorem 16.11 below, 
which is the main result of this section, tight upper bounds for the p th central moment also 
crucially rely on the function k p {x). 

Theorem 6.1 Let X be a random variable with support in [a, A]. Then, for every p > 1, 
we have that 

E[\X-^] <(A-aYK p (^\ (6.1) 
Consequently, with the notation K p = sup xg [ x j k p {x), we have that 

E[\X - p\ p ] <{A-a) p K p . (6.2) 

Furthermore, when p — > oo, we have that 

(l+p)K p ^e-\ (6.3) 



The maximum K p of the function achieved at a unique point x = x p in the interval 

[1/(1 +p), 1/2] and thus, by symmetry, also at the point 1 — x v in the interval [l/2,p/(l+p)]. 
The point x p is such that, when p — > oo, 

(l+p)x p ^l. (6.4) 

Note 6.1 Bound ( 16. ip is sharp in the sense that there is a random variable X = X\ for 
which the inequality turns into an equality. Namely, let X\ take on only two values, a and 
A, with the probabilities P[Xi = a] = (A — fi)/{A — a) and P[Xi = A] = (fi — a)/(A — a), 
respectively Inequality ( 16. lip becomes an equality. 

Note 6.2 When p — 2, then K p = 1/4, which plays a crucial role in deriving the Griiss 
bound. Formulas for K p for the integers 1 < p < 6 are given in Table 16.11 along with the 
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Xp 


K p 


1 


I = 0.50000000000000000000 


I = 0.500000000000000000000 


2 


1 = 0.50000000000000000000 


\ = 0.250000000000000000000 


3 


\ = 0.50000000000000000000 


| = 0.125000000000000000000 


4 


3 ~y^ = 0.21132486540518711775 

D 


i = 0.083333333333333333333 


5 


s-Ve^To-is = Q.16776573020222127904 


5^-14 _ 0.067088455586736913333 


6 


15-^60^10-75 _ 0.14294933504534875025 


4 v^- 5 = 0.056660078819803832059 

ld5 


7 


0.12500637707104845945 


0.049087405277751670707 


8 


0.11111148199402853664 


0.043304947663997051030 


9 


0.10000001858448876931 


0.038742049800000743380 


10 


0.09090909172727279935 


0.035049389983188641270 



Table 6.1: The values of x p and K p for the integers 1 < p < 10. 

corresponding values of x p G [0,1/2]. By symmetry, the maximum K p is also achieved at 
the point 1 — x p G [1/2, 1]. However, throughout this paper we use x p to denote the only 
existing point in the interval [0, 1/2] such that 

K p K p (^X p ^j . 

(We refer to the proof of Theorem 16.11 for the existence and uniqueness of x p .) When p 
becomes large, expressions for x p and K p become unwieldy, due to the fact that x p is a 
certain solution to a polynomial equation of a high degree, for which explicit solutions are 
not known to the best of our knowledge. For this reason, in Table I6TT1 we have provided only 
numerical values of x p and K p for the integers 7 < p < 10. 



Note 6.3 In the proof of Theorem 16. II we shall establish lower and upper bounds for x p and 
K p . Namely, we shall show that 



p+l\l+pj ~ p ~p+l\l+pJ 2 1+ p 



" ^ <K r <^U-\ + _> (6.5) 



and 



1 



D P = 2/(l + J P —^ ). (6.7) 



^ = - when 1 < p < 3, 

7 <x P < 2 i (6.6) 

P + 1 < D p when p > 3, 

where 

D„ = 2 / II 4- J 

Note that _D P — >■ 1 when p — > oo. Bounds (16.61) imply that x p ~ l/(p + 1) when p — > oo. 

We next present a few results under additional assumptions on X. For example, if we 
have more precise information about the location of the mean \i than just fi £ [a, A] (see, 
e.g., Zitikis [38J for a related discussion), then the following corollary to Theorem 16. II holds. 

Corollary 6.1 Let X be a random variable with support in [a, A], and let [a*, A*} be a sub- 
interval of [a, A] such that fi £ [a*, A*]. Then for every p > 1 we have that 



E[\X-^] < (A -ay max ^(x) : xe ^ J. 

Proof. Bound (16. 8 p follows from equation ( 16 . 1 0[) and the bound (cf. bound (16. ip ) 

E[|y - m?] < //(i - v'T + /^ /p (i - mO = «p(mO, 



(6.8) 



where 



, u — a 

^ = ■7 e 

A — a 



a A* — a 



A — a A — a 



This concludes the proof of Corollary 16.11 



In some situations the random variable X might be symmetric, in which case we have 
the following proposition. 

Proposition 6.1 Let X be symmetric with support in [a, A]. Then, for every p > 1, 

E[\X-^)< { -^^. (6.9) 

Proof. Denote Y = (X — a) /{A — a). The random variable Y has support in [0, 1] and its 
mean is // = (/i — a) /(A — a). Hence, 

fc^l = E[|y-,r]. (6.10) 



Since Y is symmetric (around its mean //), and the mean p! is in the interval [0, 1], we have 
that \Y — p!\ does not exceed 1/2. Hence, the right-hand side of equation (I6.10p does not 
exceed 1/2 P . This concludes the proof of Proposition 16.11 ■ 

Proof of Theorem 16.11 Edmundson [11] proved that if / is a convex function and X is 
a random variable with support in [a, A], then 

E[/P0] < ^/(o) + XT^)> ( 6 - n ) 



where /i is the mean of X. This result was subsequently extended by Madansky [30] and is 
nowadays known as the Edmundson-Madansky inequality. Since the function f(x) = \x — p\ p 
is convex for every p > 1, the Edmundson-Madansky inequality gives bound (16. ip . Bound 
(16. 2p follows trivially. Statement (16. 3p follows from the fact that (p/(l +p)) p — > e" 1 and the 
bound 

1 < P y<K r <J-(-L-)' + -L, (6.12, 



p+lVi + p/ ~~ p_ j>+lvl+P/ 2 1+ p 

which holds for every p > 1 as we shall next prove. We start with the upper bound and 
show that, for every x G [0, 1], 

Since K p (x) = K p (l — x), we only need to check bound ( 16.131) for x G [0,1/2]. With the 
notation h(x) = x(l — x) p we have that 

k p (x) = h{x) + h{\ — x). 

On the interval [0, 1/2], the function h(x) achieves its maximum at the point x — l/(p+ 1), 
and so we have that 

h(x) < (t^—) ■ (6.14) 



p+ l V 1 +p 

On the other hand, the function /i(l — x) is increasing on the interval [0,1/2], and so it 
achieves its maximum at the point x = 1/2, thus giving the bound 

Kl-x)<^. (6.15) 

Adding up bounds (16.1411 and ( 16 . 151) . we obtain bound (16.131) . 

To establish the lower bound of ( I6.12p . we first note that K p (x) > h(x), and thus K p > 
h(x). Since the function h(x) achieves its maximum on the interval [0,1/2] at the point 
x = l/(p + l), bound K p > h(x) implies the lower bound of (16.13p . This completes the proof 
of the two bounds of ( 16. 1 2f) . 

To prove that (1 + p)x p — > 1 when p — > oo, we show that 

<x p < z—^Dp, (6.16) 



p+1 ~ P ~ p+1 



where D p is defined by equation (16. 7\i . The lower bound of (I6.16P is obvious when p = 1. 
Hence, from now on we consider the case p > 1. We want to find those x G [0, 1/2] that 
maximize the function k p (x). We check that k,' (x) = holds if and only if 

\l — x J p — (1 + pjx 

The left-hand side of equation (16. 17ft is non- negative for all x G [0, 1/2], whereas the right- 
hand side is non-negative only when x > l/(p + 1). This implies that if we find a point 
x G [0,1/2] such that the function n p (x) is maximized, then the point must be such that 
x > l/(p+ 1), thus implying the left-hand bound of (16.161) . 

Both sides of equation ( 16. 1 Tf) are increasing functions on the interval [l/(p + 1), 1/2]. 
Both sides are equal to 1 at the end point x = 1/2. Hence, the existence and uniqueness 
of x p G [l/(p + 1), 1/2) is equivalent to showing that the two functions intersect only once 
on the interval [l/(p + 1), 1/2), but if they do not intersect, then we have x p = 1/2. We 
determine this by checking if the ratio 

R(x)= {1+ P )x - 1 /(-^-Y' 1 (6.18) 



p — (1 + p)xl \l — x / 

crosses the horizontal line {(x, 1) : x G [l/(p + 1), 1/2]} only once, provided that it crosses 
at all. Note that R(l/(p+ 1)) = and _R(l/2) = 1. Furthermore, the derivative R'(x) is 
always positive on the interval [l/(p + 1), 1/2) when 1 < p < 3. Hence, the function R(x) 
achieves its maximum at the end point x = 1/2, implying that x p = 1/2 when 1 < p < 3. 
When p > 3, then the derivative R'(x) is positive on [l/(p+ l),x*), negative on (x*, 1/2), 
and vanishes at x = x*, where 



Since R(l/(p + 1)) = and i?(l/2) = 1, we therefore conclude that when p > 3, then the 
function R(x) is increasing on the interval [1 / (p+ 1), x*} with the initial value R(l/(p + 1)) = 
0, and then, once it reaches its maximum at the point x*, becomes decreasing on the interval 
[x* 1/2] with the final value i?(l/2) = 1. Since the final value is 1, we conclude that the 
function R(x) has crossed the horizontal line {(x, 1) : x G [l/(p + 1), 1/2]} exactly once. The 
crossing point is x p because it maximizes the function k p (x). This proves Theorem 16 .![ ■ 



7 Regression-based covariance bounds 



In this section, we look at the covariance Cov[X, /3(V)] from a slightly different angle. First, 
we write the equation 

Cov[X, /3(F)] = CovKF), /3(F)], (7.1) 



where 

a(y) = r x \y{y) = E[X\Y = y] - E[X], 

which is the centered regression function. The right-hand side of equation (17. ip complicates 
the problem by introducing an additional distortion function but it also simplifies the problem 
by reducing the pair (X,Y) to (Y,Y). Nevertheless, the following theorem, whose proof is 
based on an application of Holder's inequality on the right-hand side of equation (17. ip . 
offers a sharper bound than Griiss's bound (jl.3p by utilizing a regression-based dependence 
coefficient 

A ix y\ - 

Ap[X ' F] - A P [X] ■ 
We shall discuss properties of the coefficient later in this section. 

Theorem 7.1 For every pair p, q G (1, oo) such that p~ l + g" 1 = 1, we have the bound 

\Cov[X,(3(Y)]\ < A p [X,Y]T p [X,Y,f3], (7.2) 

where 

T p [X,Y,f3] = A p [X]A q [f3(y)} (7-3) 
is the regression-based Griiss factor of the bound. 

We next show that bound (17. 2ft implies Griiss's bound (11.31) . 

Statement 7.1 Setting p = 2 and (3(x) = (3q(x) = x, we have that under the Griiss condi- 
tions on X and Y , Griiss's bound U.3\) follows from Theorem\7.1 



Proof. Since ^[X, Y] < 1, we only need to show that 



UX,Y M < {A - a)< : B - b) . (7.4) 



Since X e [a, A] C R almost surely, then (see, e.g., Zitikis [38], P- 16) A 2 [X] = A/Var[X] 
does not exceed (A — a)/2. Likewise, when Y G [b, B] C R almost surely, then A2[/3(F)] = 



y/Var[Y] does not exceed (B - b)/2. Bound ([73]) follows. ■ 

We next discuss properties of the regression-based dependence coefficient A P [X, Y}. Note 
first that the coefficient is always in the interval [0, 1]. Furthermore, when X and Y are 
independent, then A P [X, Y] — 0, and when X = Y almost surely, then A P [X, Y] — 1. 

When the pair (X, Y) follows the bivariate normal distribution, then the centered regres- 
sion function takes on the form 

^M = ^p(y-Ep1), (7.5) 



and we therefore have the equation 

A p [X,Y] 



Cov[X,F] A p [Y] 
Var[F] A P [X] 



In particular, when p = 2, since A 2 [X] = v/Var[X] and A 2 [/3(Y)} = y/Var[Y], we have 
that the regression-based dependence coefficient D 2 [X, Y] is equal to the Pearson correlation 
coefficient Corr[X, Y]. Furthermore, an application of equation (17. 5p on the right-hand side 
of equation (17. ip gives equation (j2.2j) . which, assuming that (3 is differentiable, in turn gives 
equation (12. ip . 
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